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Abstract 

The determination of the primary energy and mass of ultra-high-energy cosmic-rays (UHECR) generating extensive 
air-showers in the Earth's atmosphere, relies on the detailed modeling of hadronic multiparticle production at center- 
of-mass (cm.) collision energies up to two orders of magnitude higher than those studied at particle colliders. The 
first Large Hadron Collider (LHC) data have extended by more than a factor of three the cm. energies in which 
we have direct proton-proton measurements available to compare to hadronic models. In this work we compare LHC 
results on inclusive particle production at energies ^fs - 0.9, 2.36, and 7 TeV to predictions of various hadronic Monte 
Carlo (MC) models used commonly in cosmic-ray (CR) physics (qgsjet, epos and sibyll). As a benchmark with a 
standard collider physics model we also show pythia (and phojet) predictions with various parameter settings. While 
reasonable overall agreement is found for some of the MC, none of them reproduces consistently the evolution of 
all the observables. We discuss implications of the new LHC data for the description of cosmic -ray interactions at the 
highest energies. 



1. Introduction 

In astroparticle physics, the identification and understanding of the sources of high-energy cosmic rays is one 
of the most important open problems. There is an increasing observational support for the hypothesis of particles 
being accelerated at shock fronts in supernova remnants. These cosmic rays are expected to populate the observed 
cosmic -ray spectrum up to 10'^ - 10'^ eV UJ. The sources of cosmic rays beyond that energy are not known and, due 
to the extreme requirements needed to reach such energies in acceleration processes, subject to various speculations 
including extensions of the Standard Model of particle physics |l2|,(3l|4ji]. 

A compilation of recent flux measurements of cosmic rays is shown in Fig.[T] The power law of the energy spec- 
trum of cosmic rays changes at about 10'^'^ eV. This feature is known as the knee, the origin of which is still under 
debate. Theoretical explanations have been put forward based on a change of slope in the source spectra, effects of 
leakage from the Galaxy, the assumption of changes in hadronic interactions or the production of exotic new parti- 
cles (|6[]. The ankle in the energy spectrum at ~ 10"^^ eV is often assumed to be the imprint of the change from galactic 
to extragalactic sources 10, [H] or, alternatively, a signature of the propagation of extragalactic cosmic rays through the 
microwave background radiation jsi IqI, [loll . Finally the suppression of the flux at ultra-high energies could be due 
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Figure 1: Data on the all-particle flux of cosmic rays. The flux has been scaled by E--^ to make the features clearly visible. The axis at the top 
indicates the equivalent cm. energy if the cosmic ray particles were protons. The nominal collider cm. energy for different accelerators is also 
shown. References to the data can be found in [Sj]. 



to the Greisen-Zatsepin-Kuzmin (GZK) energy loss effect Ol ll 11211 or to the fact that the sources have reached their 



maximum energy, or to a combination of both. Indeed, the most powerful astrophysical objects are expected to be 
able to accelerate particles up to a maximum energy just around 10^" eV 11311 . Models based galactic sources, which 
thus elude the GZK suppression, have also been proposed based on UHECR emission from neutron stars lll4 11511 or 
previous Gamma Ray Bursts In such cases the suppression of the spectrum would be related to the maximum 

energy that the sources can accelerate particles to. 



Knowing the elemental composition of cosmic ray particles arriving at Earth is of crucial importance to under- 
stand the production and propagation of cosmic rays. Unfortunately, cosmic rays can be measured only indirectly 
above an energy of 10'^ eV through the cascades of secondary particles, called extensive air-showers (EAS), that they 
produce in the atmosphere (for a recent review, see [5]). Only by simulating the generation of EAS and comparing 
the predictions with measurements one can draw conclusions on the primary mass composition of the arriving parti- 
cles lIlSll . With the operation of modern large-scale experiments the reliability of air-shower simulations has become 
the source of the largest systematic uncertainty in the interpretation of cosmic -ray data (l9 . 23, 21 , 22 , 23 , 24 j . While 
the electroweak interaction processes are reasonably well understood, modeling of hadronic multiparticle production 



is subject to large theoretical uncertainties that are difficult to estimate 1I25L 1261 12711 



In this context it is not surprising that in some speculative scenarios the knee in the cosmic-ray spectrum has been 
related to a change in the characteristics of hadronic interactions. In an air-shower only a part of the energy of the pri- 
mary particle is transferred to electromagnetic particles and low-energy muons that can be detected easily. Assuming 
that, for example, a rapidly increasing fraction of very high energy muons is produced in interactions just above the 
knee energy, the non-detection of the energy of these muons would result in a systematically incorrect reconstruction 
of the primary energy and lead to a knee in the observed spectrum even if there is not such a break in the primary CR 
flux (see, for example, L28,2£.^i31i]). These models could not be consti-ained by Tevatron data as the 2 x 10'^ eV 
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equivalent (fixed-target) energy of the Tevatron collider is below the knee. 



The Large Hadron Collider (LHC) at the CERN laboratory allows us to access for the first time the energy region 
above the knee in the laboratory. Therefore an analysis of inclusive particle data taken at the LHC is particularly 
interesting for constraining existing hadronic interaction models and for testing possible new mechanisms of hadron 
production |32J. Data from LHC experiments published so far have mostly been taken with detectors covering the 
central phase space region in pseudorapidity (|?7| < 2.5). This region is most easily accessible in collider experiments 
and is also the region of the highest rapidity-density of produced particles. On the other hand, since the number of 
particles in an air-shower is roughly proportional to the energy of the primary particle, the most energetic outgoing 
particles of an interaction, emitted in the very forward region of a collider experiment - such as in diffractive in- 
teractions - are the most important ones for understanding air-showers. While waiting for the results from forward 



detectors at the LHC 11331 l34i 13511 . important constraints can be derived from the already measured central particle 
production. 

In this paper we compare the predictions of several representative hadronic interaction models with single-particle 
inclusive observables measured at midrapidity at the LHC. We focus on the following observables: (i) the pseudo- 
rapidity density of charged particles at midrapidity dNch/dri\,j=o, (ii) the average transverse momentum of hadrons 
and (iii) the event-by-event distribution of the charged multiplicity, P{Nch)- Implications of the data-theory 
comparisons are discussed and further observables of direct relevance to EAS simulations, measurable with central 
and forward detectors at the LHC, are pointed out. 

The paper is organized as follows. In Section |2] we recall the main theoretical ingredients of the hadronic in- 
teraction models used in our study and discuss their differences. In Section [3] we summarize the recently collected 
inclusive charged hadron data at the LHC from the ALICE Ii36l.[37l] an d CMS f38,. 39] experiments as well as results 
from previous colliders (ISR H, UAl UA5 IH and CDFlfilH'). [Unfortunately, we could not include in 



this study the latest results from the ATLAS experiment (43\, which appeared public only when this analysis was 
being finished.] We discuss the different event trigger selection criteria applied in each experiment as those affect 
somewhat the data-model comparisons. In Section |4] we compare the measurements to the various event generators, 
and discuss the results in the context of high energy cosmic -ray interactions with the atmosphere in Section |5] We 
summarize our main findings in Section|6l 



2. High-energy hadronic interaction models 

Calculating predictions for the bulk of produced particles in hadronic interactions is not possible yet within first- 
principles Quantum-Chromodynamics (QCD). One has to resort to phenomenological models that combine funda- 
mental principles of quantum field theory - such as unitarity, analyticity and crossing - together with perturbative 
QCD (pQCD) predictions including phenomenological fits (e.g. accounting for the parton-to-hadron fragmentation) 
to experimental hadron spectra. 



General-purpose hadronic interaction models used in high-energy physics (HEP), such as pythia [46], herwig |147 1 
and SHERPA [48], are developed to learn and interpret the data measured in accelerator experiments with an emphasis on 
hard-scattering measurements (signals and backgrounds) rather than on the bulk of hadron production at lower trans- 
verse momenta. The predictions of models can be adjusted by a large number of parameters that encode fundamental 
physics quantities on the one hand, and phenomenological assumptions or simplifications on the other These models 
are typically optimized for processes that can be calculated in perturbation theory and other assumptions needed for 
generating complete final states for hadronic interactions are often kept at a more simple level. In addition, these 
models are mainly designed for proton, pion or photon interactions and not suited for nuclear interactions relevant for 
CRs collisions with air nuclei in the upper atmosphere. 

In contrast, interaction models commonly used in cosmic -ray physics such as qgsjetOI qgsjetII isi 



53(1 and sibyll ll54il55Ll56ll . are supposed to predict hadronic interactions as realistically as possible with the emphasis 
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on reproducing existing accelerator measurements and providing a reasonable extrapolation to higher energy and to 
phase-space regions where no data are available. On the one hand, there is typically only one "optimal" parameter set 
for each of the models and simplifications are made in the implementation of known QCD predictions, in particular, 
if they are not directly relevant to the production of high energy secondaries. On the other hand, more sophisticated 
models of soft particle production are implemented and great care is given to the relation of the total, elastic and 
inelastic cross sections to particle production. 

In between these two generic categories there are models such as phojet |'57','58','59']/ DPMJET and EPOS f^^, 

which are designed to be more universal and approach the sophistication of HEP models regarding some aspects of 
hard processes. At the same time the parameters of these models are tuned to describe a large set of accelerator data 
at various energies and are not supposed to be changed for tuning for different experiments. 

In the following we will consider the models dpmjet, qgsjetOI and II, sibyll 2.1, and epos vl.99 which are often 
used in cosmic -ray simulations ifisl lsll and the pythia Monte Carlo versions 6.4 and 8 as a benchmark HEP model. 
The PYTHIA predictions are shown mainly for reference as this model is used in all publications on minimum-bias 
measurements at the LHC. Since the proton-proton interaction generator in dpmjetIII is identical to phojet we will use 
the latter in our comparisons. 

Providing a description of the physics implemented in these models would be beyond the scope of this work. Here 
we will rather recall only the basic concepts behind the modehng particle production. 

2.1. Modeling of central multiparticle production 

The inclusive production of particles in high-energy hadronic collisions receives contributions from "soft" and 
"hard" interactions between the partonic constituent^ of the colliding hadrons. Soft (resp. hard) processes involve 
mainly f-channel partons of virtualities typically below (resp. above) a scale of a few GeV^. Soft scatterings 
give rise to production of hadrons with low transverse momenta and dominate hadronic collisions at low energies 
( < 20 GeV). 

Although soft processes have a virtuality scale not far from Aqcd ~ 0.2 GeV and thus cannot be treated within 
pQCD, predictions based on basic quantum field-theory principles such as unitarity and analyticity of scattering am- 



plitudes, as implemented in the Gribov's Reggeon Field Theory (RET) H63I1 . give a decent account of their cross 
sections in terms of the exchange of virtual quasi-particle states (Pomerons and Reggeons). At increasing energies, 
the Pomeron contribution, identified perturbatively with a multi-gluon exchange, dominates over those from sec- 
ondary Reggeons (virtual mesons). In particular, the soft Pomeron exchange, of colourless nature, is responsible for 
particle production in peripheral hadronic collisions and, in particular, for diffractive dissociation which accounts for 
a noticeable fraction of the total inelastic cross section all the way up to asymptotic energies. 

(Semi)hard parton-parton scatterings dominate the inelastic hadron production cross-sections for cm. energies 
above a few hundreds of GeV. Hard processes with large \q^\ » J^qcd treated within pQCD in a collinear- 

factorized approach where one sums up Eeynman diagrams of the underlying parton-parton (quarks and gluons) scat- 
terings, which give the leading logarithmic contribution, with each power of the strong coupling constant a^iq^) being 
multiplied by a collinear logarithm log q^. The scattered quarks and gluons produce then collimated bunches of final- 
state hadrons (jets) in a branching process dominated by perturb ative parton splittings described by the Dokshitzer- 



per 

Gribov-Lipatov-Altarelli-Parisi (DGLAP) equations (64,|65, 66t], and by non-perturbative hadronization when the 



parton virtuality is below 0(1 GeV). The RET approach can also be generalized to include hard processes via "cut 
(hard) Pomerons" diagrams. The physical picture is that of a colour flux tube, which is in the simplest case treated as 
a classical string that subsequently fragments into hadrons. 



Strictly speaking one cannot refer to individual partons in case of soft interactions because of confinement. Still it is a practical approach taken 
in all event generators to extend some of the concepts of hard interactions to the soft domain. 
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Historically, Monte Carlo (MC) event generators of high-energy hadronic collisions have evolved either starting 
up from the RFT approach, exemplified e.g. in the original Dual Parton Model (DPM) ll67ll . extended with a leading- 
logarithmic pQCD description for high-Pt production (based on cut-Pomerons) - such as in the phojet 



qgsjetOI and II 11491 |51ll52l|53|], sibyll |154|,|551|56|1, neXus 1168!, |69|], epos |162|] and dpmjet (60116111 cases - or they 
started from a purely collinear-factorized framework - such as in e.g. general-purpose MCs like pythia - com- 
plemented with an add-on model for truly soft Ii70i1 and diiTractive \J1] scatterings. Thus, on the one hand, the RFT 
approaches try to extend a consistent framework based on Pomeron degrees of freedom to the hard regime. On the 
other, the collider MCs contain a description based on partonic degrees of freedom (with scattering cross sections 
dumped in the infrared, below a "tunable" semihard scale) with soft and diffractive scatterings incorporated in a more 
or less ad hoc way. In both approaches the final non-perturbative transition of partons to hadrons is modeled based 



on the ideas of the Lund string fragmentation model ll72ll . At increasingly higher ^fs, in both frameworks one has to 
account for multiple scattering processes between the colliding hadrons, namely one has to include multi-Pomeron 
exchanges and/or multiple hard scattering processes. 

In the RFT framework, the single Pomeron (P) exchange amplitude is characterized by a power-like energy de- 
pendence, f^(s, t) cc with the Pomeron intercept ap(Q) ~ 1.1 leading to a corresponding energy rise of the total 
cross section cTiot - ^ lmf^(s, 0), which asymptotically violates the so-called Froissart bound (cr,o, < c log^ s) [Tstl- 
Accounting for eikonal multi-Pomeron exchanges, the cross sections are unitarized, i.e. cr(o, jnei oc log^ s, although due 



to the Abramovskii-Gribov-Kancheli (AGK) cancellations 07411 such multi-Pomeron configurations give zero contri- 
bution to inclusive particle spectra. Thus, the total soft charged particle density produced at midrapidity follows the 
energy-dependence defined by a single Pomeron exchange contribution: 



dNch(s, if) 



drj 



lmf{s,Q) 



°" inel. X ~r^' withAp = Q'p(0)-l -0.1. (1) 

,,=0 s o-™'(i) log^ s 



In pure DGLAP-based models, the central pseudo-rapidity particle density is proportional to the inclusive jet cross 
section which is given by the convolution of parton distributions functions (PDFs) and parton-parton scattering cross 
sections: 

<C^Go) = fdxidx2 [dpi ^ fyp(xupl)fj,p(x2,pl)x '''^''^yi''^'-^ &is-Apl). (2) 

The hard cross section is divergent in the limit — > and one needs to introduce a -cutoff to indicate the 
regime of validity of the perturbative approximation. At increasingly larger cm. energies, one needs to account 
for multi-parton scatterings and saturation effects. On the one hand, the cross section predicted by the regularized 
processes exceeds the total inelastic cross section, indicating that several (or multiple) hard scatterings occur per 
collision. On the other, for decreasing but still perturbative values, the integrals receive major contributions from 
the region of low parton fractional momenta (x = p ,,„„„„/ Pi,„d,„,X where the dominant gluon distribution rises roughly as 
fglp{x,p]) ~ jc^'^iia.d wjtij Ahard - 0.3. After integrating above the p^-cutoff Qo, one obtains an energy-dependence of 
the corresponding hard central charged hadron densities of the type 



dNch(s, rf) 



drj 



„=o <'(*) at log^ s 



with Ahard ~ 0.3. (3) 



Clearly, the fast growth of the gluon densities at low x results in the hard part of the particle density (oc s^h^J, Ahard ~ 
0.3) to rise with energy much faster than for soft processes (oc s^', Ap ~ 0.1). However, at sufficiently small x, the 
number of gluons is so large that new parton multiscattering phenomena have to be accounted for. First, non-linear 
{gg fusion) effects become important in the PDFs themselves, saturating their growth as x ^ iItsIi . The strength 
of these effects is controlled by the "saturation scale" Q^^^ at which parton branching and fusion processes start to 
compensate each other. Second, the probability to have simultaneous scatterings of the constituents of the colliding 
hadrons also increases leading to multiple parton interactions (MPI) in a single collision. In many MC generators one 
effectively mimics saturation effects by introducing some energy dependence to the infrared pj^-cutoff: - Q^{s). 
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In particular, choosing Qq{s) ~ as done e.g. in pythia, the power-law changes as 



dNchis, if) 



drj 



(4) 



'7=0 



which reduces the relative role of the hard contributions compared to the "linear" predictions for fixed Qo cutoff, Eq. 
Q. [We note that stricto sensu in pythia there is extra particle production from "purely" soft scatterings iItoIi which 
is not accounted for by this simple expression.] 



Not only the particle multiplicities but also their transverse momenta are sensitive to the underlying parton dynam- 
ics. In soft processes, the produced hadrons have typical transverse momenta in the sub-GeV range {if°^^ ~ 0.3 - 0.4 
GeV), without a pronounced energy-dependence for the average p^. For the hard contribution, due to the fast drop 
(oc of the differential jet production cross-section, the peak of the perturbative production sits at gluons whose 

transverse momentum is close to the cutoff, ~ Qo, producing jets of a few GeV which are often called "minijets" 
since they are perturbatively calculable but often not experimentally observable over the soft hadronic background. 
Due to the faster energy rise of the hard contribution compared to the soft one - cf. Eqs. ([T]l and (O - average hadron 
transverse momenta rise with ^/s from p^^ ~ p'f^^ to p^^ ~ Qo- As a matter of fact, the energy-dependence of the 
average p^ is expected to be even steeper when parton saturation effects come into play - due to the suppression of 
semi-hard parton production in the "dense" saturated region, ^ p]_ < Q^^^ix) and due to the rise of the saturation 
scale 2sat log(l/x) oc log( yfs) itself. The effective value of {Q^^^) controls thus the mean transverse momentum 
{p±} of a significant part of the finally produced hadrons after parton fragmentation. It is noteworthy that the MC pro- 
cedure of mimicking the saturation and MPI effects via an energy-dependent 7?^ -cutoff changes somehow this picture. 
Having the soft part of the interaction unchanged and reducing the minijet contribution - cf. Eqs. © and (O - the 
p±-nse from p'^f^ to Qo{s) proceeds slower than in the linear case, up to very high energies. 



All in all, it is clear that both the total particle pseudorapidity density dNch/dT]\^=o and the average pj^ of produced 
hadrons are sensitive to the soft and semi-hard dynamics, to the non-linear parton (saturation) effects, and to their 
implementations in the models of high-energy hadronic scattering. 



2.2. The PYTfflA event generator 

The basic ingredients of the pythia event generator are leading-order pQCD 2 — > 2 matrix elements, augmented 
by initial- and final-state parton showers (ISR and FSR, respectively), folded with parton distribution functions (in- 
terfaced here via the lhapdf v5.8.2 package Fldk ) on the initial-state and the Lund string model |72| to describe 



the final parton-to-hadron fragmentation. The infrared l/p]^ divergence of the hard (multi)parton cross sections is 
regularized by a cutoff Qo, such that l/p^ — > l/{p]_ + Q^)^- The infrared cutoff depends on the hadron-hadron 
cm. energy: Q^{s) - 2y(io) ■ {sj sqY, where !2o('So) is a reference value at a given cm. energy yfsQ, e.g. Qq~2 GeV 



at ^fsQ = 1.8 TeV is preferred by the Tevatron data 11771 17811 . Other non-perturbative ingredients of pythia include a 



Regge-based modeling of diffractive processes 117111 . plus a model for the underlying-event (UE) issuing from multi- 
parton interactions (MPI), soft scatterings, and beam-remnants iItoIi . Multiple parton collisions are treated "perturba- 
tively" - i.e. based on the eikonalization of hard scattering contributions - but require a non-perturbative ansatz for 
the impact-parameter profile of the colliding hadrons. 

In this work we use the pythia event generator in two flavours: the Fortran version 6.422 (^\ (Nov. 2009), as well 
as the C++ version pythia 8. 130 |79]. Both codes include the newest description of multiple parton interactions based 
on the concept of "interleaved evolution" given by -ordered showers jsoilsill which accounts well for correlations 
between the hard-scattering and the underlying-event components seen in the Tevatron data. Compared to pythia 6.4, 
the MPI, ISR and FSR in pythia 8 are all interleaved, and the treatment of diffraction has improved: a diffractive 
system is viewed as a Pomeron-proton collision, including hard scatterings subject to all the same ISR/FSR and MPI 
dynamics as for a "normal" parton-parton process. 
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We consider three different "tunes" of the non-perturbative ingredients (IS and FS showering, UE, beam-remnants, 
FS colour-reconnection, and hadronization) of the two versions of pythia. For pythia 6.4 we selected (via the PYTUNES 
switch) the so-called Perugia-0 (320) iHil and Atlas-CSC (306) JUIH] tunes. Comparisons of LHC data to other 
existing pythia 6 tunes can be found in the literature (see e.g. jSoU ). For pythia 8 we use the default "tune 1". The 
chosen settings describe one or more of the minimum-bias (MB), underlying-event (UE), and/or Drell-Yan (DY) data 
sets at the Tevatron [87., 82i , 18311 . and are reasonably complementary on a number of important points, as illustrated in 
TablelU 



Version 


Tuning 


Diffraction 


Qo cutoff' at 


Qo scaling 


PDF 


Proton 


FS colour 


Exp. constraints 




(PYTUNES) 




V:^^= 1.8 TeV 


power e 




profile 


reconnection 


(Tevatron) 


6.422 


Perugia (320) 


simple 


2.0 GeV 


0.13 


CTEQ5L 


ExpOfPower 


weak 


UE, MB, DY 


6.422 


Atlas-CSC (306) 


simple 


1.9 GeV 


0.11 


CTEQ6L 


double-Gauss 


weak 


MB.DY 


8.130 


Tune 1 


improved 


2.15 GeV 


0.08 


CTEQ5L 


double-Gauss 


weak 


MB 



Table 1 : Comparison of the various ingredients controlling the non-perturbative and semi-hard dynamics in the three pythia models used in this 
work. See text for details. 



Since the three tunes share the same MPI dynamics, the single most important parameter to extrapolate the hadron 
multiplicity from Tevatron to LHC energies is the exponent e which controls the dependence of the MPI infrared 
cutoff scale on the collision energy, see Eq. (HJi. Given that a single value of Qq is used to regularize both the hard 
scattering and the MPI in the event, and that the generation of additional parton-parton interactions in the underlying 
event is suppressed below Qq, a higher scaling power of the infrared cut-off implies a slower increase of the overall 
hadronic activity. The default ("untuned") pythia 6.4 energy scaling e — 0.08, was chosen to follow the scaling of 
the total cross section (Ap ~ 0.08). However, most of the subsequent tunes needed to significantly increase the scaling 
power value to e a; 0.13, in order to find a better agreement with the V^-dependence of the UE and MB measured 



from SppS and Tevatron 1182118311 . One of the first conclusions from the MB LHC data, as we will see next, is that this 



exponent needs to be lowered again. 

For the initial-state Atlas-CSC uses the CTEQ6L parton densities I'ss'] plus a double-Gaussian transverse distri- 
bution for the partons in the proton, whereas Perugia uses the default CTEQ5L PDFs 1, 8 9,1 and a smoother proton 
form-factor (exponential-of-power, exp(-r"), with exponent n - 1.7) which decreases the multiplicity fluctuations. 
The Tune-1 of pythia 8 is closer to Atlas-CSC in terms of the energy-scaling of the pQCD cut-off (it features the 
smallest exponent, s = 0.08, of the three models) and of the proton profile but uses the default CTEQ5L PDFs. For the 
final-state, the three tunes have weak final-state colour reconnections, which increase the final particle multiplicities, 
compared to models with stronger colour correlations. 



2.3. Models based on Reggeon Field Theory (RFT) 

The RFT-based MCs considered here start off from a construction of the hadron-hadron elastic scattering ampli- 
tude to determine the total and elastic cross sections. Experimental data allow one to fix the basic model parameters, 
such as the intercept and slope of the Pomeron Regge trajectory, the Pomeron-hadron couplings, etc. The inelastic 
events are then understood in terms of cut Pomerons which correspond to colour flux tubes (treated as strings) ex- 
tended between constituent partons of the colliding hadrons. Final hadrons are produced in the fragmentation of the 
strings. The different models used in this work (Table |2]i differ in various approximations for the collision configu- 
rations (e.g. the distributions for the number of cut Pomerons and for the energy-momentum partition among them), 
the treatment of diffractive and perturbative contributions as well as of high parton density effects, and the details of 
particle production from string fragmentation. 



-Parameter PARP(90) = 2 ■ £ = 0.16. 
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Model (version) 


Diffraction 


go cutoff 


Saturation 


Other 






(GeV) 


effects 




PHOJET 1.12 [52, 58j 


2-channel eikonal 


2.5 


via running Qo{s) 


FS+hadronization via pythia 6.1 15 


qgsjetOI [49J 


quasi-eikonal 


2.0 


none (flat low-x PDFs) 




SIBYLL2.1 r54. 55. 561 


2-channel eikonal 


1.0 


via running Qo{s) 




qgsjetII [51. 52. 531 


cut enhanced graphs 


1.6 


enhanced P-graphs 




EPOS V 1.99 [62] 


difFractive Pomeron 


2.0 


parametrized 


final-state collective flow 



Table 2: Comparison of the main ingredients controlling the non-perturbative and semi-hard dynamics present in the RFT-based event generators 
used in this work. 



Whereas the Regge-Gribov approximation is applied to hadrons as interacting objects in the case of sibyll, qgsjet 
and PHOJET, it is extended to include partonic constituents in epos. In the latter case, this allows the implementation of 
energy sharing between the different constituents of a hadron at amplitude level and suppresses final states with very 
large particle multiplicity. 



Except in epos, the basic implementation of diffraction dissociation follows the Good-Walker formalism |19C 
where the colliding hadrons are represented by superpositions of elastic scattering eigenstates which undergo different 
absorption during the collision. In all these models one uses two eigenstates per hadron (2-channel eikonal approach); 
in qgsjetOI the second eigenstate is "passive" (has zero coupling to Pomerons) which leads to the so-called quasi- 
eikonal approach [911 . While in sibyll v2.1 this mechanism is associated with high-mass diffraction, in the others it 
is restricted to the production of low-mass diffractive states. High-mass diffraction is treated empirically in qgsjetOI, 
accounting for single diffraction only. It is based on diffractive cuts of triple-Pomeron and Pomeron-loop graphs in 
PHOJET, and it is based on all-order resummation of cut enhanced P-graphs in qgsjetII. In epos vl.99, a special kind 
of Pomeron is used to define a diffractive event. Depending on each event configuration it can be non-diffractive, low 
mass diffraction without central particle production, or high mass diffraction. In all the cases, each initial hadron can 
be in an excited state. 



All the models account for multiple soft and (semi)hard production processes via multi-Pomeron interactions. 
The effects of high parton density at small x and the treatment of the hadronic remnants are implemented differently 
in the different generators, sibyll uses an energy-dependent transverse momentum cutoff for minijet production, 
2o(*) ~ Qo + exp (log s), based on the geometric criterion that there cannot be more gluons in a hadron than would 
fit in a given transverse area [92]. This cutoff is independent of the impact parameter of the collision, phojet uses 
also a "v/i-dependent cutoff, of the form Qois) ~ Qo + C ■ log( ^fs). qgsjetOI is based on old parton densities of the 
pre-HERA era and the (artificial) flatness of low-x PDFs, xg(x, Q^) cc const, allows one to partly avoid the problem 
of high parton densities. This issue is addressed more realistically in the successor model, qgsjetII, which is based 
on a re-summation of enhanced diagrams corresponding to Pomeron-Pomeron interactions ll5lll93il94ll . which lead 
to impact-parameter and density-dependent parton saturation for soft processes 19511 . In epos, non-linear effects are 
introduced phenomenologically via the elastic and inelastic fusion of parton-ladders. The elastic process provides 
screening, therefore a reduction of total and inelastic cross sections. The inelastic process affects particle production, 
in particular transverse momentum spectra. Finally, in contrast to other MCs, the epos version 1.99 used in this paper 
includes also a parametrizecH collective expansion in the final state for all colhding systems including p-p at LHC 
energies. 



The latest EPOS developments (epos2) I96ll97ll include a full hydrodynamical evolution. 
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3. Experimental data and event selection corrections 

Proton-(anti)proton interactions at colliders can be roughly divided into three categories according to the overall 
event topology: elastic, difFractive and non-difFractive processes. Elastic events are usually missed by the experiments 
unless specific detectors (Roman Pots) are installed inside the tunnel |98]. Inelastic interactions are often collected 
with a generic minimum bias (MB) trigger that accepts a large fraction of the particle production cross section by 
requiring a minimum activity in one or various detectorfl Often one requires a two-arm trigger with particles in 
opposite hemispheres to eliminate backgrounds mostly from beam-gas collisions and from cosmic -rays. Such a trig- 
ger selection, however, reduces significantly the detection rate of (single) diffractive collisions characterized by the 
survival of one of the colliding (anti)protons and particle production in just one hemisphere. The associated measure- 
ments are often dubbed "non single-difFractive" (NSD), although the corresponding cross sections are obtained with 
some level of model-dependence as diff'erent event-generators have different descriptions of difFractive production. 
The UA5 experiment triggered only on one arm to measure the "single-difFractive" (SD) events and by combining the 
SD and the NSD results published MB results very close to a true inelastic trigger (INEL) (using MC corrections in 
both cases). LHC experiments define their inelastic trigger in difFerent ways. 

The list oF minimum-bias (non-single difFractive and inelastic) event-selection triggers that have been employed 
by the difFerent experiments presented in this work are listed in Table [3] Since in most cases, and in particular at 
the LHC, the experimental MB-trigger corrections have been obtained using pythia and phojet, we can compare the 
MC-truth results From these two generators directly to the data. For the RFT models, we will use the predictions 
obtained both by using a theoretical definition oF an NSD event as well as by applying directly the experimental cuts 
at the hadron-level to the final states produced in the MC simulation. This will allow us to estimate the efFect oF the 
trigger definitions on the measured hadron distributions and to investigate the model dependence oF the experimental 
event-selection corrections. 



Experiment 


Collision 


Non-single difFractive (NSD) 


Inelastic (INEL) 








trigger 


trigger 


ISR 




p-p 




Cross section based 


30- 


65 GeV 




on luminosity 


UAl 




p-p 


1 or more charged particles at each 




0.2- 


- 0.9 TeV 


opposite rapidities (L5 < I77I < 5.5) 




UA5 




p-p 


1 or more charged particles at each 


1 or more charged particles 


0.2- 


- 0.9 TeV 


opposite rapidities (2.0 < I77I < 5.6) 


within 2.0 < I77I < 5.6 


CDF 




P-P 


1 or more charged particles at each 




1.' 


96 TeV 


opposite rapidities (3.2 < // < 5.9) 




ALICE 


0.9 


P-P 
-7 TeV 


1 or more tracks at each opposite 
rapidities (2.8 < t] < 5.1, -L7 < 77 < -3.7) 


1 or more hits at |?7| < 2 
OR 2.8 < 77 < 5.1 OR -1.7 <j]< -3.7 


CMS 




p-p 


1 or more particles {Ej > 3 GeV) at each 


1 or more charged particle(s) 


0.9 


-7 TeV 


opposite rapidities (2.9 < It/I < 5.2) 


in the central region \r]\ < 2.0 



Table 3: List of hadron-level cuts implemented in the "non-single diffractive" (NSD) and "inelastic" (INEL) triggers used by various proton- 
(anti)proton collider experiments. 



"^True "zero bias" triggers, which only require the passing by of the two beams by the interaction point, are more sensitive to instrumental 
backgrounds. 
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Primary charged hadrons are defined as all charged hadrons produced in the collision, including the products 
of strong and electromagnetic decays, but excluding products of weak decays. Feed-down corrections from weakly 
decaying strange resonances (mainly Kg, A, A and E) have to be accounted for in order to obtain the final hadron 
spectrum. Such corrections, which depend on the strange particle composition in the MC, reduce by about ~8% 
the total charged yield at midrapidity. In all the simulations, one takes this into account by decaying all unstable 
particles for whichOcr < 10 mm. The sole contribution from charged leptons to the reconstructed tracks in the \ow-p± 
range, comes from the Dalitz ^r" decay amounting to about 1.5% of the charged yield. ALICE does not correct for 
this contribution, whereas CMS does. We have removed this small contribution from all our model predictions by 
counting only the produced charged hadrons. 



4. Data versus models 

4.1. Particle pseudorapidity densities 

The pseudorapidity densities, dNchldrj, of charged hadrons measured in NSD collisions at the LHC (0.9, 2.36 
and 7.0 TeV) by ALICE and CMS (as well as by UA5 at 900 GeV) are shown in Fig. |2]compared to two pythia 6.4 
tunes, PYTHIA 8 and to phojet. In the pythia case, the NSD predictions are obtained switching off" the single-diffractive 
contribution^ without any hadron-level trigger. Since the eff'ects of the LHC MB-selections have been corrected for 
by the experiments themselves using pythia (and phojet as a cross-check), this is a consistent comparison. 



z 6- 



pp -> h', /s = 900 GcV 
c CMS (NSD) 
A ALICE (NSD) 
□ UA5 (NSD) 




PYTHIA 6.422 (Atlas-CSC) 
■ PYTHIA 6.422 (Perugia-0) 
. PYTHIA8.130(Tune-1) 
. PHOJET 1.12 (+PYTHIA6.11) 

, , \ , , , \ ^ 
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Figure 2: Pseudorapidity distributions of charged hadrons, h* = {h^ + h ), measured in NSD p-p events at the LHC ( ^fs = 0.9, 2.36 and 7 TeV) 
by ALICE (HH^l and CMS (MUll (and by UA5 (Hi in p-p at 900 GeV) compared to three different versions of PYTtnA and to the phojet MC. 
The dashed band is the systematic uncertainty of the CMS experiment which is similar to those of the two other measurements. 



The Perugia-0 tune underpredicts all the measured midrapidity densities - by about 20% at V* = 0-9 and 2.36 TeV 
and by about 35% at 7 TeV - whereas the Atlas-CSC tune overpredicts by 10% the central multiplicities at 0.9 TeV 
but reproduces well the data at higher cm. energies, pythia 8 is 15% (10%) below the experimental central densities 
at ^/s = 0.9 and 2.36 TeV but agrees well with the experimental shape and normalization at 7 TeV. phojet is compat- 
ible within uncertainties with the measured hadron dNch/drj distributions at 0.9 and 2.36 TeV, whereas at 7 TeV it is 
some 15% lower. The Perugia tunes were obtained mostly with Tevatron non-diffractive processes using hadrons with 
Pj_ > 0.4 GeV/c, which affects their prediction accuracy for the lowest multiplicities and lower transverse momenta 
considered here (the ALICE and CMS experimental hadron distributions are measured from ~ 100 MeV/c and ex- 
trapolated down to zero p±). The better agreement of the Atlas-CSC tune and pythia 8 with the observations is linked 
to the faster rise of particle production with yfs, as given by their smaller exponents, e = 0.1 1 and 0.08 respectively. 



''MSTJ(22)=2,PARJ(71)=10 in PYTHIA 6.4, and ParticleDecays:limitTauO = on, ParticleDecays : tauOMax = 10 in 

PYTHIA 8. 

*MSUB(92)=MSUB(93)=0 in pythia 6.4. Sof tQCD : singleDif f raction=of f in pythia 8. 
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Figure 3: Pseudorapidity distributions of cliarged liadrons, h- = (h* + h ), measured in NSD p-p events at the LHC (0.9, 2.36 and 7 TeV) by 
ALICE (HIH and CMS fHllll (and by UA5 (H in p-p at 900 GeV) compared to the predictions of qgsjetOI and II, sibyll, and epos. The 
dashed band is the systematic uncertainty of the CMS experiment which is similar to those of the two other measurements. 



see Eq. dU and Table [T] The differences among the tunes cannot be related to the treatment of final-state effects since 
all of them have hadronization parameters tuned to LEP data and a "weak" option for the FS colour reconnection 
mechanism, which thus does not further reduce the particle multiplicity. 

The comparison of the various cosmic-ray interaction generators to the NSD charged particle densities at the LHC 
is shown in Figure |3] In general the data-theory consistency is better than for pythia. At 900 GeV, all the RFT 
models are in a reasonable agreement with the measurements, because inclusive UA5 data were used for the model 
calibrations. With increasing energy, the differences among the models predictions increase, bracketing the data with a 
very good agreement for sibyll and qgsjetOI, and about H- 10% for qgsjetII and -20% for epos. Such a RFT- model-data 
comparison is not fully self-consistent because the NSD event-selection corrections applied by the LHC experiments 
have been obtained using pythia and phojet. We have carried out the same comparison but now applying the CMS 
NSD trigger (Table |3]l directly at the hadron-level in the simulations. In Table |4] we list the predicted fractions of 
single diffraction events, the efficiencies of the SD and NSD event selection, and the resulting corrections 

JM NSD-theor / j», NSD-CMS 

drj j drj 

for all the models considered and for different LHC energies. The results obtained with the RFT models using the 
theoretical and experimental triggers are practically identical, whereas pythia- and PHOiET-based corrections for the 
NSD event-selection resulted in about a 8% shift down of the measured dNch/dT]\^=o 



Model 
(TeV) 


qgsjetOI 
0.9 2.36 7 


qgsjetII 
0.9 2.36 7 


EPosl.99 
0.9 2.36 7 


sibyll2.1 
0.9 2.36 7 


SD / inel 
PP ' PP 

BSD 

snsd 
C 


0.18 0.18 0.18 
6.4% 6.0% 5.0% 
93% 95% 96% 
0.99 0.99 1.0 


0.23 0.21 0.19 

16% 19% 21% 
90% 93% 95% 
0.99 1.0 1.01 


0.17 0.17 0.16 

29% 32% 32% 
90% 92% 94% 
0.97 0.98 0.97 


0.23 0.20 0.17 

16% 22% 25% 
97% 98% 99% 
1.0 1.02 1.02 



Table 4: Fractions of single diffractive (SD) events, efficiencies of the SD (eso) and NSD (ensd) event-selections, and correction factors C, Eq. jsj. 
obtained applying the CMS NSD trigger at the hadron-level in the RFT models. 



The energy dependence of the charged hadron density at 77 = predicted by the different pythia models and by 
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PHOJET at cm. energies from = 10 GeV to ^fs - 20 TeV is presented in Fig. |4] compared to the existing NSD (left 
panel) and inelastic (right panel) data measured at SppS, Tevatron and LHC. We note that the NSD selection results in 
measured central densities which are about 15% larger than those obtained with the less-biased INEL trigger, which 
has less particles produced on average as it includes (most of) diffractive production. We notice also that the theoret- 
ical spread is smaller for the inelastic event-selection than for the NSD case, which indicates the different modelling 
of the non-diiTractive contributions in the various MCs. Up to yfs - 2.36 TeV, phojet gives the best overall agreement 
with the data. Perugia-0 fails to reproduce the pace of increase of the central particle densities at all the energies. The 
predictions of Atlas-CSC and pythia 8 globally bracket the experimental energy evolution of the hadron multiplicities 
at 77 = 0. 




Figure 4: Collision-energy dependence of the midrapidity charged hadron invariant yields in non single-dilfractive (NSD, left panel) and inelastic 
(right panel) p-p and p-p collisions compared to different tunes of pythia 6 and 8 and to phojet 1.12. 



In Fig.|5] the same data are compared to the RFT models used for air-shower simulations. The spread of model 
predictions at all cm. energies is smaller than that of the different pythia tunes. Although all RFT models reproduce 
globally well the pre-LHC data up to yfs x2 TeV, one sees different extrapolations at the current top LHC energy and 
beyond. Up to 7 TeV, the older models sibyll and qgsjetOI have better predictions for the average multiplicity than 
the newer qgsjetII and epos: The rate of the multiplicity rise between 900 GeV and 7 TeV compared to the measured 
one is 20% higher for qgsjetII and 15% smaller for epos. 

4.2. Average transverse momentum 

The end result of a MC prediction for a given MB observable is not controlled by one single physical effect, but by 
a combination of various ingredients implemented in the model, such as e.g. those listed in Tables[T]and|2] For exam- 
ple, in PYTHIA the multiplicity density can be increased by allowing more underlying-event activity, and decreased by 
allowing e.g. more colour reconnections. Hence the same final multiplicity can be obtained through different combi- 
nations of MC tunings. For example, since the underlying event pumps energy into the event, in order to maintain the 
same multiplicity distribution, the hardness of the hadron spectra must then be a function of the underlying activity. 
Thus, by combining constraints from the experimental particle and transverse-momentum flows, some extra discrimi- 
nating power can be gained. As discussed in the Introduction, the mean in a given p-p event is a sensitive probe of 
the semi-hard dynamics (multiparton interactions and saturation effects). In this section, we discuss the average (p^) 
predicted by the models in comparison to the collider data. 



12 




Figure 5: Collision-energy dependence of the midrapidity charged hadron invariant yields in non single-dift'ractive (NSD, left panel) and inelastic 
(right panel) p-p and p-p collisions compared to the predictions of qgsjetOI and II, sibyll, and epos. 



In the case of the pythia and phojet simulations we have computed (pj.) as done by CMS jssll . i.e. by fitting the 
midrapidity /7j^-differential charged hadron spectra with the Tsallis function llosll . and averaging the over that func- 
tion. For the RFT models we simply average the of all the charged particles in the central 77 range. Applying the 
NSD or full-inelastic selections does not change drastically the values of {p^^ which differ only by ~ 5%. Also, the 
exact pseudorapidity coverage of the measurement around midrapidity (e.g. IA77I < 1 or |A?/| < 2.5) does not change 
much the associated mean p^^ values (~ -4%) although an extension to full rapidities would decrease its value by about 
12%. 

The energy dependence of the average transverse momentum of charged hadrons measured from the ISR collider 
up to LHC energies is compared to the predictions of pythia and phojet (left panel) and of cosmic ray models (right 
panel) in Fig.|6] The phojet and epos results are globally in good agreement with the V^-dependence of the average 

seen in the data. The Atlas-CSC pythia tune and sibyll predict a slower rate of increase at LHC energies. On the 
contrary, the rate of the increase predicted by pythia Perugia-0 and by qgsjetII is compatible with the data but their 
absolute scale is higher by roughly 10% and 20% respectively. The pythia 8 and qgsjetOI predictions miss the shape 
and absolute magnitude of (p^) ( yfs). It is interesting to notice that the Atlas-CSC pythia tune which reproduced well 
the pseudorapidity distribution (Fig.|2]i predicts a too low value for the average while the Perugia-0 tune which 
has a too low multiplicity shows a too large {pa_). 



4.3. Multiplicity probability distributions 

The multiplicity distribution P{Nch), i-e. the probability to produce Nch charged hadrons in an event, is of special 
interest because it provides extra differential constraints on the internal details of the hadronic interaction models. 
The low multiplicity part is mostly dominated by the contributions from diffraction (and from single-cut Pomeron 
exchanges in the RFT approaches), whereas the tail of the distribution gives information on the relative contribution 
of multiparton scatterings (multi-Pomeron exchanges). The ALICE experiment has measured multiplicity distribu- 
tions within I77I < 1 using different triggers (inelastic, Tnel>0' with at least one particle measured in the considered 



77 range, and NSD) at 900 GeV, 2.36 TeV and 7 TeV IMlSTll- Such different tiiggers affect significantly the first 
few bin s of the distributions, where their maxima lie. The CMS collaboration has provided a higher statistics set of 
results ilOOll but applying a NSD trigger and, thus, with large uncertainties (up to 40%) in the low multiplicity part 
of the distributions. In Figs. |7] and [8j we show the P(M/,) probabilities for the ALICE Tnel>0' selection at the three 
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Figure 6: Average px of charged particles at midrapidity in p-p and p-p collisions as a function of compared to tlie pythia 6 and 8 and to phojet 
models (left panel) and to the predictions of qgsjetOI and II, sibyll, and epos (right panel). 



cm. energies compared to the corresponding results of pythia, phojet and RFT models. Figure|7]uses a log-scale to 
better visualize the behaviour at very large multiplicities. Figure[8]shows a zoom of P{Nch) in the low Nch < 20 region 
for PYTHIA and phojet (top panels) and for the RFT models (bottom panels). 



PYTHIA 8 and the Atlas-CSC tune of pythia 6 reproduce glob ally well the high multiplicity tail at 0.9, 2.36 and 
7 TeV whereas the Perugia-0 (as well as the D6T one 1371 llOOll ) tune predicts too few hadrons at all the energies. 



PHOJET is somehow in between: it reproduces the 0.9 and 2.36 TeV results but misses the tail at yfs - 7 TeV. In the low 
multiplicity region, Perugia-0 and pythia 8 predict too many hadrons, whereas Atlas-CSC tune and phojet are closer 
to the ALICE results. Since the modeling of diffraction is the same in Perugia-0 and Atlas-CSC, and it is similar 
in PYTHIA 8 and phojet, these results constrain well the concurrent role of diffractive scatterings and of multi-parton 
interactions implemented in all these MCs. 



In the case of RFT models, the high-A^^A tail is underestimated by epos and qgsjetOI, whereas sibyll and qgsjetII 
get a bit closer, sometimes overestimating the data. In the low-Nch region, P(Nch) ~ 4, only epos globally reproduces 
the experimental results whereas the rest of the models overestimate the measurements up to -1-30% for sibyll. The 
peak is even shifted towards lower multiplicity in the case of both qgsjet models. Thus, even if the average MB 
multiplicities at 0.9 and 2.36 TeV are well reproduced by most RFT models (Fig.O, the details of their probability 
distributions are missed and indicate possible paths for the improvement of the different model ingredients. For 
example, epos has a pretty good description of the low multiplicity part but does not produce enough high multiplicity 
events (with more than 50 particles) compared to the data. This explains why the average dNch/drj is too low for this 
model (see Fig. |3]l which seems to give a correct average multiplicity per cut-Pomeron but with too few Pomerons. 
The qgsjetII model underestimates the data at the low multiplicity end but has a much longer tail. One possible reason 
for this mismatch is that the model neglects hard Pomeron-Pomeron couplings, hence, can not describe a dynamical 
evolution of the saturation scale beyond the fixed Qq cutoff value. For sibyll, the simulations oscillate around the data 
giving a correct average value but the shape of the multiplicity distribution is not described correctly. So even if the 
average values of inclusive observables - such as the multiplicity and average transverse momentum - are predicted 
correctly within 10% by the RFT models, there is still room for improvement of the microscopic dynamics of hadronic 
interactions in order to safely extrapolate the predictions up to the highest cosmic-ray energies. 
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Figure 7: Multiplicity distributions of charged hadrons, P(Nch), measured by ALICE in lnel>0 p-p events at 0.9, 2.36 and 7 TeV (BtIi compared to 
the predictions of pythia 6 and 8 and of phojet (left plot) and of qgsjetOI and II, sibyll, and epos (right plot). 
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Figure 8: Low part of the multiplicity distributions of charged hadrons, P(Nch), measured by ALICE in lnel>0 events at 0.9, 2.36 and 7 TeV 137 
compared to the predictions of pythia 6 and 8 and of phojet (top panels) and of qgsietOI and II, sibyll, and epos (bottom panels). 
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4.4. Discussion 

The level of (dis)agreement between the three LHC inclusive hadron observables discussed in the previous sec- 
tions and each one of the eight hadronic MCs considered in this work, is summarized in Tables|5]and|6l As a general 
conclusion, none of the models can reproduce completely all the sets of measurements. However, some models need 
more retuning than others. For example, the tune Perugia-0 of pythia 6 does a poor job reproducing the minimum bias 
results measured at the LHC. As discussed previously, this is mainly due to the fact that the Perugia tunes were ob- 
tained mostly with Tevatron non-diffractive processes using hadrons with larger transverse momenta {p± > 0.4 Ge V/c) 
than those considered at the LHC. pythia 8 has a better description of the LHC data, thanks mostly to an improved 
implementation of diffractive scattering, but the default settings under Tune-1 are just a bona fide first guess. The 
Atlas-CSC tune does a better job in general but clearly cannot reproduce diffractive-enhanced data samples measured 
at the LHC (not shown here, see flOlil '). and thus it can be mostly considered as providing a useful reference for 
possible settings - in particular the power-law exponent s that regulates the cm. energy dependence of the infrared 
cutoff for (multi)parton scatterings - that should be tried with the more advanced pythia 8 code. As a matter of fact, 
recent developments in pythia 8.145 indic ate that it is possible to reproduce LHC and previous collider data with an 



updated set of parameters (Tune 4C) 1110211 . 
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Table 5 : Level of overall agreement between different PYTraA tunes and phojet 1.12 with inclusive charged hadron results measured in p- p collisions 
at 0.9, 2.36 and 7 TeV: pseudorapidity densities dNchldri\,j=o, mean transverse momentum (px), and multiplicity probabilities P(Nch) (for low and 
high values of Nek)- A tick (/) indicates a reasonable data-model agreement within experimental uncertainties, and 'over' ('under') that the MC 
tends to over (under) estimate the data. 



The RFT models (Table|6|l give an overall good description of the central pseudorapidity densities at LHC energies, 
but fail to describe consistently other characteristics of proton-proton collisions measured by the LHC experiments. 
Thus, further model improvements, particularly, those related to the treatment of inelastic diffraction and of the parton 
saturation mechanism, are highly desirable. 
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Table 6: Level of overall agreement between qgsietOI, qgsjetH, sibyll 2.1 and epos 1.99 with inclusive charged hadron results measured in p-p 
collisions at 0.9, 2.36 and 7 TeV: pseudorapidity densities dNch/dr]\rj=o, mean transverse momentum (p±), and multiplicity probabilities P(Nch) 
(for low and high values of Nd,)- A tick (/) indicates a reasonable data-model agreement within experimental uncertainties, and 'over' ('under') 
that the MC tends to over (under) estimate the data. 
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5. Implications for ultra-high energy cosmic-ray physics 



5.7. Extrapolations to ultra-high cosmic-ray energies 

In this section we show the Vi-evolution of the model predictions for the central rapidity density dNci,/dri\ij^o and 
the average transverse momentum (p^) for charged hadrons produced in p-p collisions. We present the predicted cm. 
energy dependence of the se observables up to the highest c osmic -ray energies. Both the High Resolution Fly's Eye 
(HiRes) experiment 10311 and the Pierre Auger Observatory lll04ll have measured UHECR up to energies of the order 
of 10^" eV. Beyond this energy, the CR flux from cosmologically distant sources (D > lOOMpc) is significantly re- 
duced due to the interactions of the primary cosmic -ray particles with the cosmic microwave background radiation on 
their way from the extragalactic sources to the Earth. This so-called Greisen-Zatsepin-Kuzmin (GZK) cutoff ll llll2[l 
corresponds to p-p cm. energies of order V^c^^ ~ 300 TeV, more than twenty times higher than those reachable at 
the CERN LHC. 



In Figs.|9]and[T0]we present the energy evolution predicted by all the models for dNchldTf\,^=Q up to GZK energies 
for NSD and inelastic p-p interactions respectively. The left panels show the predictions of the various pythia ver- 
sions considered and of phojet, and the right panels those of the rest of the RFT generators, phojet predictions stop 
at around a; 100 TeV, which is the maximum cm. energy where the model can be safely extrapolated to within its 
current implementation. 




Figure 9: Collision-energy dependence of the midrapidity charged hadron invariant yields in non single-diffractive (NSD) p-p collisions predicted 
by different tunes of pythia and by phojet (left panel) and by qgsjetOI and II, sibyll, and epos (right panel) MCs up to the GZK cutoff energies. 
The data points are the same as in Fig.|4](left). 



The first thing to notice is that all PYxmA tunes as well as phojet, qgsjetOI and epos feature, with different slopes, a 
power-law dependence c/A^/c/t; oc ( Y?)" - i.e. a linear behaviour in (log V^, logdNch/drjl^^o) scales - of the midrapidity 
particle-density. On the other hand, qgsjetII and sibyll show faster and slower dependencies on energy, respectively. 
Disregarding for the moment the pythia Perugia-0 and epos 1 .99 predictions, which are already lower than the particle 
multiplicities measured at 7 TeV, we see that the rest of the models can more or less reproduce the collider data up 
to LHC energies. Still they have very difi'erent extrapolations at the GZK energy range. The highest central rapidity 
densities at V*gzk predicted by qgsjetII which reaches dNchldT]\,pQ ^ 50 (off-scale in the plot), qgsjetOI and 
pythia 8 predict values of the order dNch/dT]\,j=o » 20, the Atlas-CSC pythia 6.4 tune indicates dNch/dr]\jj=o » 14, and 
finally sibyll (as epos) a low dNch/dr]\jj=o » 10 value at the GZK cutoff. 
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Figure 10: Collision-energy dependence of the midrapidity charged hadron invariant yields in inelastic p-p collisions predicted by different tunes 
of PYTHIA and by phojet (left panel) and by qgsjetOI and II, sibyll, and epos (right panel) MCs up to the GZK cutoff energies. The data points are 
the same as in Fig.|4](right). 



In Fig.[TT]we show the energy evolution of the average p± predicted by pythia 6 and 8 and by phojet (left panel) 
and by the cosmic-ray event generators (right panel). Above x 100 GeV, all the models show a power-law be- 
haviour (with varying exponents) of (p^) with collision energy. Interestingly, all the RFT MCs with the exception of 
EPOS predict a very moderate increase of {pj_) with energy: the {p±) amounts to ~0.6 GeV/c at GZK energies which 
is only 0.05 GeV/c above the current CMS result at 7 TeV. On the other hand, pythia 8, pythia 6.4 (Atlas-CSC) and 
EPOS indicate {p±)^^^ ~ 1 GeV/c (phojet, with an extrapolated {Pa.)^^^^ ~ 0.8 GeV/c, is somewhat in between). 

The different energy behaviour of the {p±) reflects directly the assumptions made in the models related to the low- 
X behaviour of the PDFs. For example, in models with saturation of parton densities, the mean transverse momentum 
of the produced hadrons is of the order of the saturation scale Qq in the high-energy limit. In the case of the two 
QGSJET models, the {pj_} does initially increase with energy and later approaches an asymptotic value. This behaviour 
is related to the fact that parton saturation is neglected in qgsjetOI while in qgsjetII there is no dynamical evolution 
of the saturation scale above the fixed Qo value. In contrast, assuming saturation also for highly virtual partons, 
saturation effects lead to a continuous increase of the average faster than log s, as found in phojet or pythia. epos 
would have had a behaviour similar to qgsjet, but because of the final-state collective parton expansion is taken into 
account in this model, the generated flow increases the average rapidly above ^fs ^20 TeV. 

5.2. Implications for the interpretation of extensive air showers 

Extensive air showers initiated by interactions of primary cosmic ray particles (protons and nuclei) with air nu- 
clei in the upper atmosphere constitute multi-step cascade processes. The backbone of an air shower is the hadronic 
cascade of interactions of both primary and secondary hadrons - (anti)nucleons, pions, and kaons - which quickly 
dissipates the initial energy between many hadronic particles and which pumps the energy into secondary electromag- 
netic (e.m.) cascades, mostly via decays of the produced neutral pions. In turn, high energy e.m. cascades proceed 
mainly via pair production of electrons and positrons from photons and by e"^ bremsstrahlung, with a relatively weak 
feedback to the hadronic cascade via photonuclear interactions. 

Experimental methods of EAS detection include measurement of charged particles arriving at ground level and 
studies of longitudinal shower development - mostly via observations of fluorescence and Cerenkov light produced 
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Figure 11: Collision-energy dependence of the midrapidity average px_ in non single-diffmctive (NSD) p-p and p-p collisions predicted by dilferent 
tunes of PYTHIA and by phojet (left panel) and by qgsjetOI and II, sibyll, and epos (right panel) MCs up to the GZK cutoff energies. The data points 
are the same as in Fig.|6] 



by charged particles (e'^ mainly) - see 110511 for a review of detection techniques. The most important shower observ- 
ables are the position of the shower maximum, X^ax, i-S- the depth in the atmosphere (in g/cm^) where the number of 
charged particles reaches its maximum, the number of particles at maximum A^max, and the number of electromagnetic 
particles (e^^, y) a nd m uons (yU^) at ground. Most of EAS characteristics measured at ground are closely related to 
^max and to A^max 1 106']. For example, the number of charged particles (mainly e'^) at ground A^^. strongly increases for 
deeper showers (where the maximal number of charged particles is reached close to the ground) and falls down for 
decreasing Xmax - due to the quick shower attenuation in the atmosphere. On the other hand, the number of muons A'^ 
at ground, which emerge mainly from charged pion and kaon decays, has a weak dependence on X,^^^, being instead 
sensitive to charged particle multiplicity of hadron-air and nucleus-air collisions. This explains the importance of the 
latter observable for CR composition studies with ground-based detectors. Finally it should be noted that the number 
of muons is also dependent on the physics of low-energy interactions in air showers. With pions typically decaying 
only after their en ergy i s red uced to ~20 GeV, low-energy interactions of cm. energy in the range of 10 - 50 GeV 



become important Ill07lll08ll . 



The relation between hadronic interactions at high energies and EAS observables has been studied numerically in 



detail in 112711 . Here we recall only the most important interaction parameters that determine the longitudinal develop- 
ment of air showers. The depth of shower maximum depends mainly on the characteristics of multiparticle production 
of the first few generations of hadronic interactions in a shower It is mainly related to the inelastic cross section of 
the primary incoming particle for interaction with air nuclei and on the corresponding energy fraction transferred to 
secondary particles but the most energetic ("leading") secondary particle, relative to the primary particle, which is 
called inelasticity. Additionally, although less strongly, X^ax depends on the inelastic cross sections and inelastici- 
ties for interactions of secondary hadrons with air nuclei. The third feature of direct relevance to the position of the 
shower maximum is the multiplicity of the primary and subsequent very high-energy interactions, which defines how 
the energy is distributed to secondary particles and corresponding sub-showers, i.e. whether many particles with small 
energy or few particles with large energy are produced. 

The LHC measurements of charged hadron pseudorapidity density and multiplicities presented here, have consid- 
erable importance for CR physics. First, the observations by the ATLAS, CMS and ALICE collaborations indicate 
that dN ch I dT]\ri=Q changes smoothly in the lab energy range from 10'^ to 3 x 10'^ eV, and are well bracketed, at the 
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+ 10% level at the highest energies, by the predictions of current interaction models used for EAS simulations. The 
bulk of hadron production data did not reveal serious deficiencies in the overall description of high energy hadronic 
collisions implemented in the event generators currently used in cosmic ray physics. This gives a strong support to the 
overall interpretation of the experimental results in terms of primary CR spectrum and nuclear mass composition in 
the knee energy range (10'^ -^ eV) obtained by various collaborations that applied these models for the corresponding 
data analysis |[109i,i2d,ll lOlll 1 ih . At the same time, none of the models is in perfect agreement with all the hadronic 
observables measured at the LHC (see Table|6|l, underlining the ne ed for re-tuning model parameters and reconsider- 
ing model assumptions. New versions of epos joF 



9711 and qgsjet 111 1211 are under development, partially influenced 



by the input from the LHC data. 



Second, the new experimental results strongly disfavour speculative ideas that the knee in the observed cosmic 
ray spectrum may be due to a sudden change in the hadronic interaction mechanism above 2 TeV cm. energy (see, 
for example. Indeed, any generic model that has about 20% of primary energy being transferred to 

particles that are not observed in EAS experiments just above 3 x 10'^ eV lab, would predict that proton showers of 
primary energy 3 x 10'^ eV, corresponding to about 7 TeV cm. energy, would have to be reconstructed with a 25% 
lower energy in order to attribute the knee in the spectrum solely to the production of such new exotic particles. Of 
course, so far only data from central LHC detectors have been published, covering just a small fraction of the energy 
released in final state particles. However, it is difficult to imagine how such a significant change of the interaction 
properties from 2 to 7 TeV cm. energy should only affect the particles close to the beam axis. More realistically, one 
has to include also nuclei as cosmic rays in the knee energy range, leading to even larger energy fractions that need to 
be channeled to undetected particles in hadronic interactions. 



Third, the first measurements at ^fs -1 TeV support a conventional extrapolation of the known features of multi- 
particle production to higher energies. Still it is not clear, whether the new data of the Pierre Auger Collaboration l:23ll . 
indicating a change to rather small shower-to-shower fluctuations that would be typical for a primary mass compo- 
sition dominated by medium or heavy elements, c ould not a lso be attributed to a change of the characteristics of 
hadronic interactions at energies above 3 x 10'** eV ill 13lll 1411 . But the current wide range of predictions for the par- 
ticle densities, dNci,/dri\,j=() ^ 10 (epos, sibyll) - 50 (qgsjetII), as well as for the mean hadron transverse momentum, 
{p±) ~ 0.6 (sibyll, qgsjetOI) - 1 (epos) GeV/c, justifies the concurrent use of various MCs to gauge the uncertainties 
connected to hadronic interaction models in the interpretation of the cosmic ray data at GZK-cutoff energies. 



Last but not least, the measurements of -differential spectra and mean pj_ of hadrons at LHC midrapidity do 
not have a direct impact on the interpretation of air shower data because the shape of lateral distributions of electrons 
(positrons) and muons at ground level is rather defined by multiple Coulomb scattering in the atm osphere and by 
the transverse momentum spectra of secondaries at much lower (fixed-target) energies il07l IIO8 I. Nevertheless, 
the corresponding results are of importance for testing the overall physics consistency of soft and hard interaction 
mechanisms implemented in the models as well as for the understanding of the parton saturation and multi-scattering 
phenomena discussed above. 



5.3. Importance of forthcoming LHC measurements 

The first multiplicity measurements at ^fs - 7 TeV (2.5 x 10"' eV in lab system) have put already serious con- 
straints on the interpretation of cosmic ray data. Coming p-p data at the nominal cm. energy of -y^ = 14 TeV, 
corresponding to cosmic-ray protons of 10'^ eV in the lab frame, will further reduce the uncertainty linked to the 
extrapolation of the interaction models to ultra-high energies, as they will the expected proton-nucleus runs {pPb 
collisions at ^^fs^ - 8.8 TeV) [35, 115]. First results on the dNch/drilj^o, and its centrality dependence, measured in 
Pb-Pb collisions at -sjs^ - 2.76 TeV 01 1611 will provide extra important cross-checks on the role of initial-state gluon 
saturation effects in the hadronic wave functions. 



There are many more measurements that can be done already at the current LHC energy to reduce the uncertain- 
ties of air shower predictions. Key information is expected, for example, from the measurement of the distribution 
of forward neutral hadrons (tt" and neutro ns) by LHCf 1 1 17] and possibly other experiments using zero degree and 
other forward calorimeters 1 118 . 1 19l 120ll in both p-p and Pb-Pb interactions. Of similar importance would be the 
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measurement of the energy flow and particle spectra in the very forward direction, in the pseudorapidity range from 
5 to 10. This is an angular range that is very difficult to access in collider experiments but it is parti ally c overed by 
various detectors in the LHC experiments 1 121], for example, TOTEM [98] and CASTOR (CMS) lfl22ll . Another 
fundamental input for tuning and extrapolating models are the total, elastic, and diffractive cross secti ons a ccessible 
to measurement thanks to various forward proton detectors (existing such as TOTEM, ATLAS ALFA [123], or pro- 
posed lll24ll ) in the LHC tunnel are a. Ex amples of the impact of the current uncertainty of the cross section on air 
shower observables are discussed in 11125 1. 
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Figure 12: Examples of midrapidity distributions whose measurement in p-p collisions at = 7TeV will help to refine the modehng of 
(anti)baryon production in current event generators. Left: Collision-energy dependence of the ratio of p to multiplicities. Right: Ratio of 
p to charged particle multiplicities as a function of the central pseudorapidity density. 

It is worth mentioning that other measurements that can be performed with central detectors at the LHC, such 
as e.g. of the rapidity spectra of identified hadrons, are of significant importance for cosmic ray physics. Indeed, 
due to diff'erent lifetimes of charged pions and kaons, knowledge of their relative contributions to the multiplicity 
of secondaries is required for precise calculations of the total number and of the energy spectra of muons produced 
during EAS development. Also the production of antibaryons is a theoretically poo rly understood aspect of hadronic 
interactions and data on baryon pair production are sparse. As demonstrated in 112611 . the muon content of air showers 
may be significantly enhanced if the rate of production of (anti)baryons is up to 30% higher than assumed in models 
like QGSJET and sibyll. Baryon-induced subshowers lead to a higher number of muons at ground than meson-induced 
ones. The energy dependence of the overall baryon production rate, its relation to the centrality of the collision, and 
the momentum distribution of the baryons are important quantities to be measured. This is illustrated in Fig. [T2] where 
CR model predictions for antiproton production in p-p at = 7 TeV are shown. The number of fragmenting strings 
leads to a so-called delayed energy threshold for baryon pair production. This threshold is lowest in sibyll, being well 
below the Tevatron energy, and largest in epos where it is in the energy range of the LHC (Fig. [T2] left). Also their 
predicted asymptotic limits are very diff'erent. The dependence of antiproton production on the midrapidity particle 
density in proton-proton collisions at the LHC is shown in Fig. [12] (right). The peak at low central multiplicities is 
related to the fact that antibaryons are pair-produced with baryons in a single string, consuming a large fraction of the 
energy stored in the corresponding string. Peripheral interactions are typically simulated with only two color-strings 
maximizing the energy per string, epos differ s not only quantitatively but also qualitatively in the antibaryon-baryon 



pair production model from the other models 11127 1 
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6. Summary 



We have compared first CERN LHC data on inclusive charged hadron production - central pseudorapidity densi- 
ties dNch/dr]\ri=a, multiplicity probabilities P(Nch), and mean transverse momentum {pj_) - with hadronic interaction 
models (qgsjetOI and II, sibyll, and epos 1.99) used for the simulation of extensive air showers generated by ultra- 
high-energy cosmic rays in the Earth atmosphere. We have also included in this comparison commonly used collider 
physics event generators, such as pythia (various tunes of version 6.4, and the new pythia 8) and phojet, in order to 
identify the model ingredients most sensitive to bulk hadron production. In all the cases we considered only those 
versions of the interaction models that were developed and tuned before the LHC data became available. 

In contrast to most Monte Carlo event generators for collider physics, models developed for air shower simula- 
tion are based on Reggeon Field Theory (RFT) and are optimized to predict the overall characteristics of the final 
state particles including those of predominantly soft interactions. Moreover, these models are designed to deal with 
proton-nucleus and nucleus-nucleus interactions and allow for an extrapolation to energies as high as ~ 400 TeV. 
Therefore the comparison to LHC data is an important benchmark for the quality of the models and hence the relia- 
biUty of air shower simulations currently used to interpret cosmic ray (CR) data. 

The quality of the LHC data description varies from model to model and differs for difl'erent observables. A 
first general observation is that none of the models considered provides a very good description of all the LHC data 
considered here. Yet, the CR models bracket the LHC central rapidity densities and the low region of the multi- 
pUcity probabiUties more naturally than most of the pythia tunes. The main difference appears to be on the more 
advanced treatment of diffractive scattering in the RFT models compared to the latter. The measured midrapidity 
charged-particle density is found to follow a simple power-law in energy, j"^, from 10 GeV up to 7 TeV cm. energy 
with exponent e « 0.10. This observation can be used to retune some basic model ingredients and improve their 
extrapolations to the highest energies. Such a result constrains in particular the way in which multiparton interactions 
and gluon saturation are implemented in various MCs (e.g. in PYxmA, phojet and sibyll), via an energy-dependent 
infrared transverse momentum cutoff, Qoi ^fs), for (multi)parton scatterings. 

The studies presented here have also shown that it is very difficult, if not impossible, to compare model predic- 
tions with data at the sub-5% level if the trigger used for data-taking cannot be implemented without a full detector 
simulation. It is highly desirable to have the LHC inelastic, minimum bias and diffractive data published with a simple 
hadron-level prescription for emulating the data trigger, to which the measurements have been corrected to. Only then 
we will be able to make best use of LHC results to lower the uncertainty of various non-perturbative model ingredients 
and to improve the description of the transition from soft to hard parton dynamics (multiparton interactions, gluon 
saturation, ...). 

With -\fs - 7 TeV, LHC is the first collider reaching an energy higher than the knee in the energy spectrum of 
cosmic rays. The LHC hadron multiphcity measurements give strong support to the conventional interpretation that 
the break in the power-law index of the observed CR spectrum at 10^^-^ eV is indeed due to a feature of the pri- 
mary cosmic ray flux. Alternative interpretations of the knee being a side-effect of rapidly changing properties of 
hadronic interactions above ~ 2 TeV are strongly disfavoured. Similarly the LHC measurements support the 
interpretation of air shower data in the knee energy range as reflecting a change from a light to a more heavy mass 
composition. No new or exotic physics assumptions or extrapolations are needed for describing the overall event 
features measured in the central pseudorapidity region at the LHC. While re-tuning of model parameters to match 
LHC data will improve the reliability of air shower simulations, there is no indication from the LHC results that the 
extrapolations have to be changed significantly. At the highest CR energies of (9(10^" eV) - i.e. more than twenty 
times higher than those cm. energies reachable in p-p at the LHC - the current wide range of predictions for the 
particle densities, dNchldri\ri=Q ~ 10 (epos, sibyll) - 50 (qgsjetII), as well as for the mean hadron transverse momen- 
tum, {p±} X 0.6 (sibyll, qgsjetOI) - 1 (epos) GeV/c, justify today the concurrent use of all RFT MCs to gauge the 
uncertainties linked to the underlying hadronic interactions in the interpretation of the cosmic ray data at GZK-cutoff 
energies. 
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Future measurements at the LHC will further cross-check interaction models and help to understand better the un- 
derlying hadron production processes. Of direct relevance to the interpretation of air shower data are the multiplicity 
and spectrum of produced high-energy particles (leading particles, neutral and charged) emitted at forward rapidi- 
ties, the contribution of different particle types to the overall final state multiplicity, the inelastic cross section and its 
breakdown in various diffractive contributions, as well as the production of baryon-antibaryon pairs at mid-rapidity in 
proton-proton, nucleus-nucleus and proton-nucleus collisions. 
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